Picture for Saksham Suri

Saksham Suri

UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders

Add code
Jan 25, 2026
Viaarxiv icon

VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Add code
Jan 08, 2026
Viaarxiv icon

EdgeTAM: On-Device Track Anything Model

Add code
Jan 13, 2025
Viaarxiv icon

Efficient Track Anything

Add code
Nov 28, 2024
Viaarxiv icon

VeriGraph: Scene Graphs for Execution Verifiable Robot Planning

Add code
Nov 15, 2024
Figure 1 for VeriGraph: Scene Graphs for Execution Verifiable Robot Planning
Figure 2 for VeriGraph: Scene Graphs for Execution Verifiable Robot Planning
Figure 3 for VeriGraph: Scene Graphs for Execution Verifiable Robot Planning
Figure 4 for VeriGraph: Scene Graphs for Execution Verifiable Robot Planning
Viaarxiv icon

LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior

Add code
Oct 28, 2024
Viaarxiv icon

UVIS: Unsupervised Video Instance Segmentation

Add code
Jun 11, 2024
Figure 1 for UVIS: Unsupervised Video Instance Segmentation
Figure 2 for UVIS: Unsupervised Video Instance Segmentation
Figure 3 for UVIS: Unsupervised Video Instance Segmentation
Figure 4 for UVIS: Unsupervised Video Instance Segmentation
Viaarxiv icon

LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors

Add code
Mar 21, 2024
Viaarxiv icon

Gen2Det: Generate to Detect

Add code
Dec 07, 2023
Viaarxiv icon

Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization

Add code
Aug 18, 2023
Viaarxiv icon